<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE html
     PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN"
     "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd">
<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en">
<head>
<title>LingPipe: Citations and References</title>
<meta http-equiv="Content-type"
      content="application/xhtml+xml; charset=UTF-8"/>
<meta http-equiv="Content-Language"
      content="en"/>
<link href="css/lp-site.css"
      title="lp-site"
      type="text/css"
      rel="stylesheet"
      media="screen,projection,tv" />
<link href="css/lp-site-print.css"
      title="lp-site-print"
      type="text/css"
      rel="stylesheet"
      media="print,handheld,tty,aural,braille,embossed"/>
</head>

<body>

<div id="header">
<h1 id="product">LingPipe</h1><h1 id="pagetitle">Citations</h1>
<a id="logo"
   href="http://alias-i.com/"
  ><img src="img/logo-small.gif" alt="alias-i logo"/>
</a>
</div><!-- head -->


<div id="navig">

<!-- set class="current" for current link -->
<ul>
<li><a href="../index.html">home</a></li>

<li><a href="demos.html">demos</a></li>

<li><a href="licensing.html">license</a></li>

<li>download
<ul>
<li><a href="download.html">lingpipe core</a></li>
<li><a href="models.html">models</a></li>
</ul>
</li>

<li>docs
<ul>
<li><a href="install.html">install</a></li>
<li><a href="../demos/tutorial/read-me.html">tutorials</a></li>
<li><a href="../docs/api/index.html">javadoc</a></li>
<li><a href="book.html">textbook</a></li>
</ul>
</li>

<li>community
<ul>
<li><a href="customers.html">customers</a></li>
<li><a href="http://groups.yahoo.com/group/LingPipe/">newsgroup</a></li>
<li><a href="http://lingpipe-blog.com/">blog</a></li>
<li><a href="bugs.html">bugs</a></li>
<li><a href="sandbox.html">sandbox</a></li>
<li><a href="competition.html">competition</a></li>
<li><a class="current" href="citations.html">citations</a></li>
</ul>
</li>

<li><a href="contact.html">contact</a></li>

<li><a href="about.html">about alias-i</a></li>
</ul>

<div class="search">
<form action="http://www.google.com/search">
<p>
<input type="hidden" name="hl" value="en" />
<input type="hidden" name="ie" value="UTF-8" />
<input type="hidden" name="oe" value="UTF-8" />
<input type="hidden" name="sitesearch" value="alias-i.com" />
<input class="query" size="10%" name="q" value="" />
<br />
<input class="submit" type="submit" value="search" name="submit" />
<span style="font-size:.6em; color:#888">by&nbsp;Google</span>
</p>
</form>
</div>

</div><!-- navig -->


<div id="content" class="content">

<h2>Citing LingPipe</h2>

<p>If you want to cite the LingPipe software, we suggest following the
<a href="http://www.chicagomanualofstyle.org/"><i>Chicago Manual of Style</i></a>'s
<a
href="http://library.osu.edu/sites/guides/chicagogd.php">
guideline 17.356</a> for citing web sites in scientific articles.  For
the bibliography, they suggest the following form: </p>

<ul class="bib">
<li>
Alias-i. 2008.  LingPipe&nbsp;4.1.0.  http://alias-i.com/lingpipe <span style="color:#555">(accessed October 1, 2008)</span>
</li>
</ul>

<p>For inline citations, that would be:
</p>

<ul>
<li>
(Alias-i 2008)
</li>
</ul>

<!--
<h2>Presentations Mentioning LingPipe</h2>
<p>
&nbsp;
</p>
-->

<h2>Papers Mentioning LingPipe</h2>

<p>
We list papers that we wrote, as well as papers written by others.
</p>


<h3>Papers from Alias-i</h3>

<p>We've spent much more time writing code, javadoc and tutorials than papers,
but we have produced a few to go along with workshops or bakeoffs.
</p>

<ul class="bib">

<li>
Carpenter, Bob.  2007. LingPipe for 99.99% Recall of Gene Mentions. <i>Proceedings of the 2nd BioCreative workshop</i>. Valencia, Spain.
<a href="http://www.colloquial.com/carp/Publications/biocreative-8-alias-i.pdf">[pdf]</a>
</li>

<li>
Carpenter, Bob.  2006. Character language models for Chinese word segmentation and named entity recogntion. Proceedings of the <i>5th ACL Chinese Special Interest Group (SIGHan)</i>. Sydney, Austrlia.
<a href="http://www.colloquial.com/carp/Publications/alias-i-sighan06.pdf">[pdf]</a>
</li>

<li>
Carpenter, Bob. 2005. Scaling High-Order Character Language Models to Gigabytes. In <i>Proceedings of the Association for Computational Linguistics Workshop on Software</i>. Ann Arbor.
<a href="http://www.colloquial.com/carp/Publications/acl05soft-carpenter.pdf">[pdf]</a>
</li>

<li>
Carpenter, Bob. 2004. Phrasal Queries with LingPipe and Lucene. In <i>Proceedings of the 13th Meeting of the Text Retrieval Conference (TREC)</i>. Gaithersburg, Maryland.
<a href="http://www.colloquial.com/carp/Publications/TREC2004.pdf">[pdf]</a>
</li>

<li>
Carpenter, Bob. 2004. Orthographic variation with Lucene. In O. Gospodnetic and E. Hatcher, <a href="http://www.manning.com/hatcher2/"><i>Lucene in Action</i></a>. Manning Press.
</li>

</ul>



<h3>Third-Party Papers</h3>

<p> If we missed your paper and you'd like to see it in this list,
please drop us a line at <a
href="mailto:lingpipe@alias-i.com"><code>lingpipe@alias-i.com</code></a>.
We're speding some time every release going through <a
href="http://scholar.google.com/scholar?q=lingpipe">Google Scholar</a>, but
we've only considered 100 or so of the several hundred results
presented (450 as of this release).
</p>


<ul class="bib">

<li>
Beneti, Aspasia, Woiyl Hammoumi, Eric Hielscher, Martin Müller, and David Persons.
2006.
Automatic generation of fine-grained named entity classifications.
Technical report, University of Amsterdam.
<a href="http://ifarm.nl/erikt/ltp2006/ltp2006.pdf">[pdf]</a>
</li>

<li>
Bey, Youcef, Christian Boitet, and Kyo Kageura.
2006.
The TRANSBey Prototype: An Online Collaborative Wiki-Based CAT Environment for Volunteer Translators.
In <i>Proceedings of LREC</i>.
<a href="http://www.mt-archive.info/LREC-2006-Bey.pdf">[pdf]</a>
</li>

<li>
Bischoff, Kerstin,  Thomas Mandl and Christa Womser-Hacker.
2007.
Blind Relevance Feedback and Named Entity Based Query Expansion for Geographic Retrieval at GeoCLEF 2006.
In <i>  Evaluation of Multilingual and Multi-modal Information Retrieval, CLEF 2007</i>.
Springer.
<a href="http://www.springerlink.com/content/j6p106053978226j/">publisher link]</a>
</li>

<li>
Bradford, R. B.
2006.
Relationship Discovery in Large Text Collections Using Latent Semantic Indexing.
In <i>Proceedings of SDM 06</i>.
<a href="http://www.siam.org/meetings/sdm06/workproceed/Link%20Analysis/15.pdf">[pdf]</a>.
</li>

<li>
Buscaldi, Davide and Paolo Rosso.
2007.
On the Relative Importance of Toponyms in GeoCLEF.
In <i>Proceedings of CLEF 2007</i>.
<a href="http://www.dsic.upv.es/~prosso/resources/BuscaldiRosso_GeoCLEF07revised.pdf">[pdf]</a>
</li>

<li>
Chambers, Nate and Shan Wang. 2006.
Temporal Ordering of Event Descriptions.
CS 229 Class Project.  Stanford University.
<a href="http://www.stanford.edu/class/cs229/proj2006/ChambersShan-TemporalOrderingOfEventDescriptions.pdf">[pdf]</a>
</li>

<li>
Chen, Jiangping, He Ge, Y. Wu, and S. Jiang.
2004.
UNT at TREC 2004: Question Answering Combining Multiple Evidences.
<i>Text Retrieval Conference (TREC)</i>.
<a href="http://www-nlpir.nist.gov/trec/pubs/trec13/papers/unorthtexas.qa.pdf">[pdf]</a>
</li>

<li>
Chen, Jiangping, Ping Yu and He Ge.
2005.
UNT 2005 TREC QA Participation: Using Lemur as IR Search Engine.
In <i>Proceedings of TREC 2005</i>.
<a href="http://www-nlpir.nist.gov/trec/pubs/trec14/papers/unorth-texas.qa.pdf">[pdf]</a>
</li>

<li>
Clarke, James and Mirella Lapata.
2007.
Modelling Compression with Discourse Constraints.
In <i>Proceedings of EMNLP/CoNLL 2007</i>.
<a href="http://acl.ldc.upenn.edu/D/D07/D07-1001.pdf">[pdf]</a>
</li>

<li>
Corbett, Peter, Colin Batchelor, and Simon Teufel.  2007.
Annotation of chemical named entities.
In <i>Proceedings of BioNLP 2007</i>, 57-64, Prague.
<a href="http://acl.ldc.upenn.edu/W/W07/W07-10.pdf">[pdf]</a>
</li>

<li>
D'Avanzo, Ernesto and Bernardo Magnini.
2005.
A Keyphrase-Based Approach to Summarization:
the LAKE System at DUC-2005.
In <i>Document Understanding Conference</i>.
<a href="http://tides.nist.gov/pubs/2005papers/itc-irst.ernesto.pdf">[pdf]</a>
</li>

<li>
Dale, Robert and Pawel Mazur. 2007.
The Semantics of Temporal Expressions.
In <i>Proceedings of the Twentieth Australian Joint Conference on Artificial Intelligence</i>.
435-444.
Gold Coast, Queensland, Australia.
<a href="http://www.ics.mq.edu.au/~rdale/publications/papers/2007/48300435.pdf">[pdf]</a>
</li>

<li>
Damianos, Laurie, Jay Ponte, Steve Wohlever,
Florence Reeder, David Day, George Wilson, and Lynette Hirschman.
2002.
MiTAP for Bio-Security: A Case Study.
<i>AI Magazine</i> <b>23</b>(4):13-29.
<a href="http://www.aaai.org/ojs/index.php/aimagazine/article/viewArticle/1666">[pdf]</a>
</li>

<li>
Denecke, K.
2008.
Using SentiWordNet for multilingual sentiment analysis.
In <i>IEEE 24th International Conference on Data Engineering Workshop (ICDEW)</i>.
507-512.
<a href="http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=4498370">[publisher site]</a>
</li>

<li>
Deschacht, K., M. F. Moens, and W. Robeyns.
2007.
Crossmedia entity recognition in nearly parallel visual and textual documents.
<i>8th RIAO Conference on Large-Scale Semantic Access.</i>
</li>

<li>
Duong, Deborah, Ben Goertzel, Jim Venuto, Ryan Richardson, Shawn Bohner, and Edward Fox.
2006.
Support Vector Machines to Weight Voters in a Voting System of
Entity Extractors.
In <i>Proceedings of the International Joint Conference on Neural Networks (IJCNN)</i>.
<a href="http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=1716242">[publishers page]</a>
</li>

<li>
Favre, Benoît,
B Favre,
Frédéric Béchet, and
Pascal Nocéra.
2005.
Robust Named Entity extraction from large spoken archives.
In <i>Proceedings of HLT/EMNLP</i>.  491-498.  Vancouver.
<a href="http://acl.ldc.upenn.edu/H/H05/H05-1062.pdf">[pdf]</a>
</li>

<li>
Gasperin, Caroline.
2006.
Semi-supervised anaphora resolution in biomedical texts.
In <i>Proceedings of BioNLP Workshop on Linking Natural Language Processing
and Biology at HLT-NAACL</i>.  96-103.
New York City.
<a href="http://acl.ldc.upenn.edu/W/W06/W06-3316.pdf">[pdf]</a>
</li>

<li>
Geoffrey, Andogah.  2007.
GIR [Geographic Information Retrieval] Experimentation.
In <i>Evaluation of Multilingual and Multi-modal Information Retrieval, CLEF 2006</i>.
Springer.
<a href="http://www.springerlink.com/content/8631l7357326r524/">[publisher link]</a>.
</li>

<li>
He, Ying and Mehmet Kayaalp.
2006.
A Comparison of 13 Tokenizers on MEDLINE.
Lister Hill National Center for Biomedical Communications Technical Report LHNCBC-TR-2006-003.
<a href="http://www.lhncbc.nlm.nih.gov/lhc/docs/reports/2006/tr2006003.pdf">[pdf]</a>
</li>

<li>
Iftene, Adrian and  Alexandra Balahur-Dobrescu.
2008.
Answer Validation on English and Romanian Languages.
In <i>Proceedings of CLEF 2008</i>.
<a href="http://www.clef-campaign.org/2008/working_notes/iftene_paperCLEF2008_AVE.pdf">[pdf]</a>
</li>

<li>
Iftene, Adrian and  Alexandra Balahur-Dobrescu.
2007.
Hypothesis Transformation and Semantic Variability Rules Used in Recognizing Textual Entailment.
In <i>Proceedings of the Association for Computational Linguistics (ACL)</i>.
<a href="http://acl.ldc.upenn.edu/W/W07/W07-1421.pdf">[pdf]</a>
</li>

<li>
Iftene, Adrian and  Alexandra Balahur-Dobrescu.
2007.
UAIC Participation at AVE 2007.
In <i>CLEF 2007, LNCS 5152</i>, 395-403.  Springer.
</li>

<li>
Kabiljo, Renata and Adrian J. Shepherd.
2008.
Protein Name Tagging in the Immunological Domain.
In <i>Proceedings of SMBM 2008</i>.
<a href="http://mars.cs.utu.fi/smbm2008/files/smbm2008proceedings/smbmpaper_22.pdf">[pdf]</a>
</li>

<li>
Kaljurand, Kaarel, Fabio Rinaldi, James Dowdall, and Michael Hess.
2004.
Exploiting Language Resources for Semantic Web Annotations.
In <i>Proceedings of LREC</i>.
<a href="http://serv1.ist.psu.edu:8080/showciting;jsessionid=F2EFFF954CAA7C0F78F3102808C61E62?cid=871649">[CiteSeer]</a>
</li>

<li>
Kashani, Mehdi M. and Fred Popowich.
2006.
Pronoun Generation for Text Summarization and Question Answering.
In <i>Proceedings of 5th Slovenian and 1st international Language Technologies Conference</i>.
<a href="http://nl.ijs.si/is-ltc06/proc/16_Kashani.pdf">[pdf]</a>.
</li>

<li>
Leaman, Robert and Graciela Gonzalez.
2008.
Banner: an executable survey of advances in biomedical named entity recognition.
In <i>Proceedings of the Pacific Symposium on Biocomputing (PSB)</i> 13:652-663.
<a href="http://psb.stanford.edu/psb-online/proceedings/psb08/leaman.pdf">[pdf]</a>
</li>

<li>
Li, Yi,  Alistair Moffat, Nicola Stokes, and Lawrence Cavedon. 2006.
Exploring Probabilistic Toponym Resolution for
Geographical Information Retrieval.
In <i>3rd Workshop on Geographic Information Retrieval (GIR)</i>.
<a href="http://www.geo.unizh.ch/~rsp/gir06/papers/individual/li.pdf">[pdf]</a>
</li>


<li>
Mason, Joshua, Kathryn Watkins, Jason Eisner, and Adam Stubblefield.
2006.
A natural language approach to automated cryptanalysis of two-time pads.
In <i>Proceedings of the 13th ACM Conference on Computer and Communications Security</i>.
<a href="http://www.cs.jhu.edu/~jason/papers/mason+al.ccs06.pdf">[pdf]</a>
</li>

<li>
Mazur, Pawe&#x142; and Robert Dale.
2007.
The DANTE Temporal Expression Tagger.
In <i>Proceedings of the 3rd Language and Technology Conference</i>. Poznan, Poland.
<a href="http://www.ics.mq.edu.au/~rdale/publications/papers/2007/paper.pdf">[pdf]</a>
</li>

<li>
Mazur, Pawe&#x142; and Robert Dale.
2007.
A Rule Based Approach to Temporal Expression Tagging.
In <i>Proceedings of the International Multiconference on Computer Science and Information Technology (IMCSIT) 2nd International Symposium: Advances in Artificial Intelligence and Applications</i>. Wisla, Poland.
<a href="http://www.ics.mq.edu.au/~rdale/publications/papers/2007/cla07RD.pdf">[pdf]</a>
</li>

<li>
Melli, Gabor, Yang Wang, Yudong Liu, Mehdi M. Kashani, Zhongmin Shi,
Baohua Gu, Anoop Sarkar, and Fred Popowich. 2005.
Description of SQUASH, the SFU Question Answering Summary Handler for
the DUC-2005 Summarization Task.
In <i>Proceedings of the Document Understanding Conference (DUC)</i>.
</li>

<li>
Molla, Diego and Menno Van Zaanen.
2005.
Learning of graph rules for question answering.
In <i>Proceedings of ALTW</i>.
<a href="http://web.science.mq.edu.au/~diego/publications/altw05.pdf">[pdf]</a>
</li>

<li>
Neumann, Günter and Bogdan Sacaleanu.  2005.
Experiments on Robust NL Question Interpretation and Multi-layered Document Annotation for a CrossLanguage Question/Answering System.
In <i>Multilingual Information Access for Text, Speech and Images, CLEF 2004</i>.
Springer.
<a href="http://www.springerlink.com/content/a40av7eu42v4j5n0/">[publisher link]</a>.
</li>

<li>
Ofoghi, Bahadorreza, John Yearwood and Liping Ma.
2007.
The Impact of Semantic Class Identification and Semantic Role Labeling on Natural Language Answer Extraction
In <i>Advances in Information Retrieval (ECIR) LNCS 4956</i>.  430--437.
Springer.
<a href="http://www.springerlink.com/content/d20t100882050h68/">[publisher link]</a>
</li>

<li>
Klinger, Roman, Corinna Kolárik, Fluck, Juliane, Hofmann-Apitius,
Martin, and Friedrich, Christoph M. 2008.  Detection of IUPAC and
IUPAC-like chemical names.  <i>Bioinformatics</i> 24(13):i268-i276.
</li>

<li>
Perea-Ortega, José M., Miguel Angel García Cumbreras,
Manuel García-Vega, and Luis Alfonso Ureña López.
2008.
SINAI-GIR System: University of Jaén at GeoCLEF 2008.
In <i>Proceedings fo GeoCLEF 2008</i>.
<a href="http://clef.isti.cnr.it/2008/working_notes/Perea-Ortega-paperGeoCLEF2008.pdf">[pdf]</a>
</li>

<li>
Schilder, Frank, Andrew McCulloh, Bridget Thomson McInnes, and Alex Zhou.
2005.
TLR at DUC: Tree similarity.
<i>Proceedings of the Document Understanding Conference (DUC)</i>.
<a href="http://duc.nist.gov/pubs/2005papers/thomson-lr.schilder.pdf">[pdf]</a>
</li>

<li>
Strötgen, Robert, Thomas Mandl, and René Schneider. 2006.
A Fast Forward Approach to Cross-Lingual Question Answering for English and German.
In <i>Accessing Multilingual Information Repositories, CLEF 2005</i>.  Springer.
<a href="http://www.springerlink.com/content/m166w66n45l6v1u5/">[publisher link]</a>.
</li>

<li>
Stokes, Y.L.N., L. Cavedon, and A. Moffat.
2006.
NICTA I2D2 Group at GeoCLEF 2006.
<i>Proceedings of CLEF</i>.
<a href="http://www.cs.mu.oz.au/~alistair/abstracts/lscm06geoclef.pdf">[pdf]</a>
</li>

<li>
Sureka, Ashish, Sudripto De, and Kishore Varma.
2008.
Mining Automotive Warranty Claims Data for Effective Root Cause Analysis.
In <i>Database Systems for Advanced Applications, LNCS 4947</i>. 621-626.  Springer.
<a href="http://www.springerlink.com/content/d1125563n12x31v6/">[publisher link]</a>
</li>

<li>
Tratz, Stephen, Antonio Sanfilippo, Michelle Gregory, Alan Chappell,
Christian Posse and Paul Whitney.
2007.
PNNL: A Supervised Maximum Entropy Approach to Word Sense
Disambiguation.
In <i>Proceedings of the 4th International Workshop on Semantic Evaluations (SemEval-2007)</i>.
264--267.  Prague.
<a href="http://acl.ldc.upenn.edu/W/W07/W07-2057.pdf">[pdf]</a>
</li>


<li> Vlachos, Andreas.  2006.  Active annotation.  In <i>Adaptive Text
Extraction and Mining (ATEM)</i>.  <a
href="http://acl.ldc.upenn.edu/eacl2006/ws06_atem.pdf">[pdf]</a>
</li>

<li>
Vlachos, Andreas and Caroline Gasperin.
2006.
Bootstrapping and Evaluating Named Entity Recognition in the Biomedical
Domain.
In <i>Proceedings of the BioNLP Workshop at HLT-NAACL</i>.
<a href="http://acl.ldc.upenn.edu/W/W06/W06-3328.pdf">[pdf]</a>.
</li>

<li>
Vlachos, Andreas, Caroline Gasperin, Ian Lewin, and Ted Briscoe.
2006.
Bootstrapping the Recognition and Anaphoric Linking of Named Entities in Drosophila
Articles.
In <i>Proceedings of the Pacific Symposium on Biocomputing 11</i>:100-111.
<a href="http://helix-web.stanford.edu/psb06/vlachos.pdf">[pdf]</a>
</li>

<li>
Vlachos, Andreas.
2007.
Evaluating and combining biomedical named entity recognition systems.
In <i>Proceedings of ACL Workshop</i>.
<a href="http://acl.ldc.upenn.edu/W/W07/W07-10.pdf">[pdf]</a>
</li>

<li>
Wang, Hudong, Shannon Bradshaw and Marc Light.
2005.
Automatic highlighting of bioscience literature.
In <i>Proceedings of BioLink</i>.
</li>

<li>
Zhu, Weizhong, Chaomei Chen, and Robert B. Allen.
2006.
Visualizing the Evolution of Social Networks.
Poster presented at <i>IST Research Day 2006</i>.  Drexel University.
<a href="http://dspace.library.drexel.edu/bitstream/1860/1596/1/2007021022.pdf">[pdf]</a>
</li>

</ul>


<h2>Patent Applications</h2>

<p>Yes, we've even been mentioned in 3rd-party patent applications!  One doesn't
need to own all the intellectual property mentioned in a patent to get
a patent.</p>

<ul>
<li>
Frankie E. D. Patman and Charles Kinston Williams.
2007.
Filtering extracted personal names.
U.S. Patent Application 20070005578A1.
<a href="http://www.google.com/patents?id=vtyXAAAAEBAJ">[Google Patents]</a>
</li>
</ul>



<h2>Courses using LingPipe</h2>

<p>I know there are more out there, but these are the only syllabi
I could find online (search: <i>&lt;syllabus lingpipe site:.edu&gt;</i>).
Let us know if you are using us in your class, especially if you'd
like help.</p>

<ul>
<li>
William Lewis.  2008.
<i>Ling 570:
Shallow Processing Techniques for Natural Language Processing.</i>
University of Washington.
<a href="http://courses.washington.edu/ling570/will_fall08/570-syllabus.htm">[syllabus]</a>
</li>
</ul>



</div><!-- content -->



<div id="foot">
<p>
&#169; 2003&ndash;2011 &nbsp;
<a href="mailto:lingpipe@alias-i.com">alias-i</a>
</p>
</div>
<script type="text/javascript">
var gaJsHost = (("https:" == document.location.protocol) ? "https://ssl." : "http://www.");
document.write(unescape("%3Cscript src='" + gaJsHost + "google-analytics.com/ga.js' type='text/javascript'%3E%3C/script%3E"));
</script>
<script type="text/javascript">
try {
var pageTracker = _gat._getTracker("UA-15123726-1");
pageTracker._trackPageview();
} catch(err) {}</script></body>
</html>


